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Interactive Schemes for the AWGN Channel with 

Noisy Feedback 

Assaf Ben-Yishai and Ofer Shayevitz 


Abstract —We study the problem of communication over an 
additive white Gaussian noise (AWGN) channel with an AWGN 
feedback channel. When the feedback channel is noiseless, the 
classic Schalkwijk-Kallath (S-K) scheme is known to achieve 
capacity in a simple sequential fashion, while attaining reliability 
superior to non-feedback schemes. In this work, we show how 
simplicity and reliability can be attained even when the feedback 
is noisy, provided that the feedback channel is sufficiently better 
than the feedforward channel. Specifically, we introduce a low- 
complexity low-delay interactive scheme that operates close to 
capacity for a fixed bit error probability (e.g. 10“®). We then 
build on this scheme to provide two asymptotic constructions, 
one based on high dimensional lattices, and the other based on 
concatenated coding, that admit an error exponent significantly 
exceeding the best possible non-feedback exponent. Our approach 
is based on the interpretation of feedback transmission as a side- 
information problem, and employs an interactive modulo-lattice 
solution. 

I. Introduction 

While feedback cannot increase the capacity of point-to- 
point memoryless channels |[T|, there exist noiseless feedback 
communication schemes that can provide a significant im¬ 
provement in terms of simplicity and reliability, see e.g. m- 
0. However, these elegant feedback schemes completely fail 
in the presence of arbitrarily low feedback noise, rendering 
them grossly impractical. This naturally raises the question of 
whether simplicity and reliability can still be achieved to some 
degree in a practical setup of noisy feedback. In this paper, 
we address this question in a Gaussian setting and answer it 
in the affirmative. 

The setup we consider is the following. Two Terminals A 
and B are connected by a pair of independent AWGN channels, 
and are limited by individual power constraints. The channel 
from Terminal A (resp. B) to Terminal B (resp. A) is referred 
to as the feedforward (resp. feedback) channel. Terminal A 
is in possession of a message to be reliably transmitted to 
Terminal B. To that end, an interactive communication model 

The authors are with the Department of EE-Systems, Tel Aviv University, 
Tel Aviv, Israel {assafbster@gmail.com, ofersha@eng.tau.ac.il}. The work of 
A. Ben-Yishai was partially supported by an ISF grant no. 1367/14. The work 
of O. Shayevitz was supported by an ERC grant no. 639573, a CIG grant no. 
631983, and an ISF grant no. 1367/14. This paper was presented in part at 
Allerton 2014 and ISIT 2015. 


is adopted where both terminals are allowed to employ coding 
and exchange signals on the fly. This model is sometimes 
referred to as active feedback, and should be distinguished 
from the passive feedback setting where no coding is allowed 
over the feedback channel. The AWGN channel with noiseless 
feedback was studied in the classical works of Schalkwijk 
and Kailath 0, 0, who introduced a capacity-achieving 
communication scheme referred to herein as the S-K Scheme. 
This linear-feedback coding scheme employs a first-order 
recursion at both terminals, and is markedly simpler than its 
non-feedback counterparts that typically employ long block 
codes and complex encoding/decoding techniques. In terms 
of reliability, the error probability attained by the S-K scheme 
decays super-exponentially with the delay, in contrast to the 
weaker exponential decay achieved by non-feedback codes ||6l. 
However, this scheme and its linear feedback generalizations 
are not robust to any amount of feedback noise, as was initially 
observed in 0 and further strengthened in 0. 

The main contribution of this work is in showing that to 
some extent, the merits of noiseless feedback can be carried 
over to the practical regime of noisy feedback. Contrary to 
the noiseless feedback case, these improvements in simplicity 
and reliability are not simultaneously achieved. In terms of 
reliability, we construct two interactive protocols that are 
of comparable complexity to non-feedback schemes, but are 
superior in the asymptotic error exponent sense. In terms of 
simplicity, we depart from the standard asymptotic regime and 
show how a fixed (but low) error probability can be attained at 
a small capacity gap, where the latter term refers to the amount 
of excess SNR required by the scheme above the minimum 
predicted by the Shannon limit. Both these constructions are 
useful when the signal-to-noise ratio of the feedback channel 
sufficiently exceeds that of the feedforward channel. 

As a case in point, consider the high-SNR regime and 
assume that the SNR of the feedback channel exceeds the SNR 
of the feedforward channel by 20dB. Then our simplicity- 
oriented scheme operates at a capacity gap of merely O.SdB 
with only 19 rounds of interaction, and attains a bit error 
rate of 10“®. This should be juxtaposed against two reference 
systems, operating at the same bit error rate; On the one 
hand, state-of-the-art non-feedback codes that attain the same 
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capacity gap require roughly two orders of magnitude increase 
in delay and complexity. On the other hand, the capacity gap 
attained by a minimal delay uncoded system is at least 9dB. 
Finally, under the same setup, our reliability-oriented schemes 
attain an error exponent exceeding the sphere-packing bound 
of the feedforward channel for a wide range of rates below 
capacity. 

The construction we introduce is based on endowing the S- 
K scheme with modulo-lattice operations. As observed in lO, 
the feedforward S-K scheme can be interpreted as a solution 
to a Joint Source-Channel Coding (JSCC) problem via analog 
transmission. Here, we further observe that the feedback trans¬ 
mission problem can be cast as a similar problem but with side 
information (i.e. the message) at the receiver (i.e.. Terminal 
A). This observation is crucial for our construction, and is 
leveraged by means of modulo-lattice analog transmission in 
the spirit of Kochman and Zamir 0. 

Let us briefly describe the simplicity-oriented version of our 
scheme. Terminal A encodes its message into a scalar 0 using 
pulse amplitude modulation (PAM). In subsequent rounds. 
Terminal B computes a linear estimate of 0, and feeds back 
an exponentially amplified version of this estimate, modulo a 
fixed interval. The modulo operation facilitates the essential 
“zoom-in” amplification without exceeding the power limit, at 
the cost of a possible modulo-aliasing error. In turn. Terminal 
A employs a suitable modulo computation and obtains (if 
no modulo-aliasing occurs) the estimation error, corrupted by 
excess additive noise. This quantity is then properly scaled and 
sent over the feedforward channel to Terminal B. After a fixed 
number of rounds. Terminal B decodes the message using a 
minimum distance rule. Loosely speaking, the scheme’s error 
probability is dictated by the events of a modulo-aliasing in 
any of the rounds, as well as the event where the remaining 
estimation noise is larger than the minimum distance of the 
PAM. 

We also introduce two asymptotic reliability-oriented ver¬ 
sions of our scheme. The first is based on an asymptotic 
generalization of the simple interaction idea above, where a 
block code replaces the PAM modulation, and a block S- 
K scheme is used in conjunction with a multi-dimensional 
modulo-lattice operation, replacing the scalar modulo. We pro¬ 
vide an asymptotic error exponent analysis using the Poltyrev 
exponent to account for modulo-aliasing errors, and channel 
coding exponents to account for the error of the block code. 
The second scheme we present is based on concatenated 
coding, with the scalar simplicity-oriented scheme as an inner 
code and a block outer code. Since the discrete memoryless 
channel (DMC) induced by the inner code and viewed by the 
outer code is non-Gaussian, we give a lower bound for the 
error exponent based on the performance of a “worst case” 


symmetric DMC. 

Related work. In Q, m, the authors analyzed the relia¬ 
bility function of the AWGN channel at zero rate for noisy 
passive feedback, i.e. where the channel outputs are fed back 
without any processing. In CD , the authors gave an interesting 
analysis of the reliability of transmission of an M-ary message 
(M > 3) over AWGN with passive noisy feedback, but in 
a slightly different setting where a peak energy constraint 
is imposed. In CD, the authors considered a concatenated 
coding scheme with a passive linear-feedback inner code 
and a block outer code, and provided some error exponent 
results. In Section IVI-GI we compare our reliability-oriented 
scheme to m, and show that the exponent obtained in ifT^ 
is better for low rates whereas our exponent is better for 
high rates. In ifTSl . which is closer to our interactive setting, 
the reliability function associated with the transmission of a 
single bit over an AWGN channel with noisy active feedback 
has been considered. Specifically, it was shown that active 
feedback roughly quadruples the error exponent relative to 
passive feedback. The achievability result of ifTSl is better than 
ours at zero rate but does not extend to positive rates. 

Organization. Notation and definitions are given in Sec¬ 
tion HI] The problem setup is introduced in Section Hnj Neces¬ 
sary background is given in Section |IV| Simple interaction is 
addressed in Section |Vl and improving reliability is addressed 
in Section IVll 

IT Notation AND Definitions 

In the sequel, we use the following notation. For any number 

def 

a; > 0, we write XdB = 101og]^o(®) to denote the value of x 
in decibels. The Gaussian Q-function is 
1 

Q (x) — / exp (—m^/2) du 

V ^TT JX 

and is its functional inverse. We write f{x) = 0{g{x)) 

for limsup 2 ,^o ,3 \f{x)/g{x)\ < oo. We write log for base 2 
logarithm, and In for the natural logarithm. We use the vector 
notation x" = (xi,...,Xn) and boldface letters such as x 
to indicate vectors of size N\. We write a„ > to mean 
liminf„_>oo ^ In > 0, and similarly define < and =. 

We use A to denote the complementary of an event A. 

The Capacity Gap. Recall that the Shannon capacity of the 
AWGN channel with signal-to-noise ratio SNR is given by 

ilog(l + SNR). (1) 

This is the maximal rate achievable by any scheme (of 
unbounded complexity/delay, with or without feedback) under 
vanishing error probability. Conversely, the minimal SNR 
required to reliably attain a rate R is 2^^ — 1. The capacity 
gap r attained by a coding scheme that operates at rate R over 
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Fig. 1. Block diagram of interactive coding over an AWGN channel with 
noisy feedback 


an AWGN channel, is the excess SNR required by the scheme 
over the minimum predicted by the Shannon limit, i.e.. 


def SNR 
- 22fl - 1 ■ 


( 2 ) 


Note that if a nonzero bit/symbol error probability is allowed, 
then one can achieve rates exceeding the Shannon capacity 
O, and this effect should in principle be accounted for, to 
make the definition of the capacity gap fair. However, for small 
error probabilities the associated correction factor (related to 
the corresponding rate-distortion function) becomes negligible, 
and we therefore ignore it in the sequel. 


III. Setup 

Our problem setup is depicted in Fig. [T] The feedforward 
and feedback channels connecting Terminal A to Terminal B 
and vice versa, are AWGN channels given by 


past feedback channel outputs over the feedforward channel, 
i.e.. 

Similarly, Terminal B sends function of its past observations 
to Terminal A over the feedback channel, i.e.. 


Remark 1: In general, we allow these functions to further 
depend on common randomness shared by the terminals. We 
note in passing that our definition of the feedback transmission 
scheme is sometimes referred to as active feedback, the term 
passive feedback is usually reserved to the special case where 

^(V") = r„. 

The number of rounds N is fixed. While feedback protocols 
with variable transmission length exist and can improve reli¬ 
ability relative to non-feedback transmission El, im, they 
are beyond the scope of this work. We assume that Terminal 
A (resp. Terminal B) is subject to an average power constraint 
P (resp. P), namely 

N N 

Y.m'i)<N-p, Y. E{X^) <N-P. 

n—1 n—1 

We denote the feedforward (resp. feedback) signal-to-noise 
ratio by SNR ^ (resp. SNR ^). The excess signal- 
to-noise ratio of the feedback over the feedforward is denoted 
by ASNR §§§■ Throughout this work, we assume that 
ASNR > 1. 

An interactive scheme {ip, ip) is associated with a rate R = 
and an error probability pe, which is the probability that 
Terminal B errs in decoding the message W at time N, under 
the optimal decision rule. 

IV. Preliminaries 


V _ Y 

^ n. — n 


Y„ — Xn + Zn 


where Xn,Yn (resp. Xn,Yn) are the input and output of 
the feedforward (resp. feedback) channel at time n respec¬ 
tively. The feedforward (resp. feedback) channel noise ^ 
(resp. Zn ^ is independent of the input 

Xn (resp. Xn), and constitutes an i.i.d. sequence. The feedfor¬ 
ward and feedback noise processes are mutually independent. 

Terminal A is in possession of a message W, uniformly 
distributed over the set {0, ...,M — 1}, to be described to 
Terminal B over N rounds of communication. To that end, 
the terminals can employ an interactive scheme defined by 
the sequences of functions (p„ and as follows: At time n. 
Terminal A sends a function of its message W and possibly of 


In this section, we describe the building blocks underlying 
our interactive scheme. First, we review the use of uncoded 
PAM signaling, and discuss its associated capacity gap. Then, 
we describe the basic problem of joint source-channel coding 
(JSCC) via analog transmission, and show how to build the 
S-K scheme from uncoded PAM and iterative JSCC. Lastly, 
we discuss the problem of JSCC with side information using 
modulo arithmetic, and present a simple scalar solution that is 
later implemented as part of our simplicity-oriented scheme. 

A. Uncoded PAM 

PAM is a simple and commonly used modulation scheme, 
where 2^ symbols are mapped (one-to-one) to the set 

{±lp,±3p,--- ,±(2«-l)p}. 
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Canonically, the normalization factor 77 is set so that the 
overall mean square of the constellation (assuming equiprob- 
able symbols) is unity. A straightforward calculation yields 
rj = y^3/ {2"^^ — 1). In the general case where the mean 
square of the constellation is constrained to be P, rj is replaced 
with rj^/P. 

It is easy to show that for an AWGN channel with zero 
mean noise of variance and average input power constraint 
P, the probability of error incurred by the optimal detector is 
given by the following formula 


Manipulating Q yields; 



which can be slightly relaxed to obtain a lower bound on R 
by 


1, / SNR 


(5) 


where 



Comparing © and O, we see that PAM signaling with 
error probability pe admits a capacity gap of at most ro(pe)- 
Comparing © and © it is clear that this upper bound is tight 
as R increases. For a typical value of pe = 10“®, the capacity 
gap of uncoded PAM is at most Fo^dB = 9dB. 

Finally, we assume as usual that bits are mapped to PAM 
constellation symbols via Gray labeling. The associated bit 
error probability can thus be bounded by 



Q 



+ 2Q 3 



El 

R' 


(7) 


The bound follows by noting that erring toward the nearest 
neighbor incurs an error in a single bit, and by taking a worst 
case assumption for all other error events. The approximation 
becomes tight for small pe due to the strong decay of the 
Q-function. 


B. Joint Source-Channel-Coding (JSCC) via Analog Trans¬ 
mission 

It is well known in that when a Gaussian source is to 
be transmitted over an AWGN channel under a quadratic 
distortion measure, analog transmission obtains the optimal 
distortion (given by equating the rate-distortion function to 


the channel capacity) with minimal delay. This solution is a 
simple instance of joint source-channel coding (JSCC). More 
explicitly, we wish to convey a Gaussian r.v. e ^ A/'(0,tTg) 
over an AWGN channel Y = X -\- Z, Z ^ N cr^), with 
expected input power constraint < P (i.e. SNR = J 7 ). 
The optimal transmission and estimation boil down to X = ae 
and 'e = jJY. The optimal choice of a yields a power scaling 


ZE 

CTe ’ 

Wiener coefficient 6 = 


factor, i.e. a = and optimal choice of j3 yields the 


/SNR 


^SNr+T- Plugging a and 

the minimal attainable MSE in this setup: 

^2 


E(£ — s) = 


cr; 


SNR-f 1 


yields 


( 8 ) 


Namely, this JSCC scheme improves the estimation error of e 
by a factor SNR -f 1 relative to a trivial guess. In the sequel 
we shall use this simple construction as a building block for 
both the classic S-K scheme and the newly proposed noisy 
feedback schemes. 


C. The S-K Scheme 

Consider the setting of communication over the AWGN 
channel with noiseless feedback, i.e., where = 0. The S-K 
scheme can be described as follows. First, Terminal A maps 
the message W to the real-valued variable 0 using a PAM 
modulation of size 2^^. In the first round, it sends a scaled 
version of 0 satisfying the power constraint P. In subsequent 
rounds. Terminal B maintains an estimate 0„ of 0 given 
all the observation it has, and feeds it back to Terminal A. 

def 

Terminal A then computes the estimation error e„ = 0„ — 0, 
and sends it to Terminal B using analog transmission. 

(A) Initialization; 

Terminal A: Map the message FF to a PAM point 0. 
Terminal A ^ Terminal B: 

. Send Xi = y/PS 

• Receive Yi = Xi -\- Zi 

Terminal B: Initialize the 0 estimat^ to 0i = 

(B) Iteration: 

Terminal B Terminal A: 

• Send the current 0 estimate: = 0„ 

• Receive 

Terminal A: Compute the estimation error = Yn — Q. 

Terminal A => Terminal B: 

• Send e„ via analog transmission, i.e. Xn+i = otnSn, 

where where cr^ Ee^. 

. Receive F„+i = X^+i + ^„+i 

Terminal B: Update the 0 estimatJ^ 0n+i = 0n — Gi, 

where 

'Note that this is the minimum variance unbiased estimate of 0. 
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is the Minimum Mean Squared Error (MMSE) estimate 
of £ji, thus, Pn+i is the appropriate Wiener coefficient; 

o 

^’"+1 P + cr2 (j'l + SNR’ 

(C) Decoding: 

At time N, Terminal B decodes the message using a 
minimum distance decoder for 0jv w.r.t. the PAM con¬ 
stellation. 

To calculate the error probability and rate attained by the 
S-K scheme, we note that £„+i = £„ — £„. Using the property 
dUl of analog transmission yields: 

2 _ ^ _ I'Q'. 

“ 1 + SNR “ SNR(1 + SNR)"' 

An important observation is that using (|9]l and the fact that the 
power of 0 is normalized to unity, one can regard the channel 
from 0 to 0Ar as an AWGN channel with a signal-to-noise 
ratio SNRat = namely: 


D. Joint Source-Channel Coding with Side Information 

A key observation made in this work is that while the trans¬ 
mission of £„ over the feedforward link (i.e. from Terminal 
A to Terminal B) can be regarded as a JSCC problem, the 
transmission of 0„ over the feedback link (i.e. from Terminal 
B to Terminal A) can be regarded as a JSCC problem with 
side information. More explicitly, at round n, Terminal B 
holds its current estimate 0 „ = 0 -f £„ and wants to convey 
it to Terminal A, while Terminal A knows 0 and can use 
it as side information. To exploit this, we employ a lattice- 
based JSCC scheme with side information based on a more 
general scheme by Kochman and Zamir 0. We note that for 
clarity of exposition and ease of analysis, we use the high- 
SNR version of 0 , which can be slightly suboptimal in the 
low-SNR regime. 

Let us now describe this solution. For simplicity, we start 
with the scalar case. First, we need some definitions and 
properties of modulo arithmetic. For a given d > 0, the scalar 
modulo-d function is 


SNRat = SNR •(1-fSNR)^"^ (10) 


Md [x] a; — d • round 


Note that it is possible to use the (biased) MMSE at the first 
round and compensate for the bias in the last round im, 
yielding an end-to-end signal-to-noise ratio of (l-fSNR)^ —1. 
However, this improvement is negligible and also complicates 
the analysis in the sequel. Plugging SNRjv into Q and 
bounding the Q-function by Q{x) < iexp(—gives: 


where the round(-) operator returns nearest integer to its 
argument (rounding up at half). The following properties are 
easily verified. 

Proposition 1: The following properties hold; 

(i) m 4 x]g 

(ii) The modulo distributive law 


f 3SNR-(l-f SNR)^-i\ 

[-2 -- ) ■ 

Plugging in the AWGN channel capacity ([T]i and removing the 
1 ” term, we obtain: 


Pe < exp 


{ 3 SNR 

2 1-l-SNR 



which is the well-known doubly exponential decay of the error 
probability of the S-K scheme. 

Let us now provide an alternative interpretation of the S- 
K scheme performance, in terms of the capacity gap attained 
after a hnite number of rounds. Plugging SNRat in Q yields: 


R> 




SNR • (1-f SNR)^-i \ 

T )' 


Substituting the resulting R in the definition of the capacity 
gap (|2]l and assuming SNR ^ 1 yields the following approx¬ 
imation for high SNR: 


pS-K 
^ dB 


iPe,N) 


ro.dB(Pe) 

N 


This behavior is depicted by the dashed curve in Fig. |4] 


Md [Md [x -I- di] d 2 — x] = Md [di ^ 2 ] 

(iii) if di -f d 2 G [-f, f), then 

Md [Md [x -C di] + d 2 - x] = di A d 2 - ( 11 ) 

otherwise, a modulo-aliasing error term of kd is added 
to the right-hand-side (fTTl l. for some integer k f 0. 

(iv) Let V ^ Uniform([—|, |)). Then Md[x-|-U] is uni¬ 
formly distributed over [—f, |) for any x G M. 

(V) E(Md [x + U])2 = f!. 

Using the above properties, we can provide the following 
solution to the JSCC with side information problem over the 
feedback link. Terminal B is in possession of 0 = 0 -f £ 
and wishes to convey it to Terminal A, where £ ^ 
and 0 is known to Terminal A. The terminals are connected 
through an AWGN channel: Y = X -\- Z, with the appropriate 
noise distribution Z ^ A/^(0, a^) and an input power constraint 
< P. Let V ^ Uniform([—|, |)) be a dither signal 
known at both terminals. Then, Terminal B transmits 


X = 


70 -f V 
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where we set d = \/i^ to guarantee that the power 
constraint is satisfied. Terminal A computes the estimate 


e= h 

7 


Y-^e-v 


Hence, by Proposition [T] property (|u]i: 


e = h 

7 


'ye + Z 


( 12 ) 


In the case where 'ye + Z G [—f, f) (Proposition [T] property 
(Iml ll we obtain 


e — e — Z. 

7 


(13) 


The question that arises at this point is how to set 7. Clearly, 
a large 7 would increase the probability of a modulo-aliasing 
error, but at the same time would reduce the additive estima¬ 
tion error in e. Denoting the modulo-aliasing error probability 
by Pmod- Proposition [T] property (Iml l implies that 

Pmod Pr (je + Z^ [-f, I 


Recalling that d = \/ 12P, and that e and Z are jointly 
Gaussian with known variances, we obtain 


Pmod — 2Q 


3P 


72 ct| + 


(14) 


Let us now introduce the looseness parameter L, defined as; 

P 


L = 


7 ^cr| -I- (T^ 

Using this definition, we can write pmod as 


(15) 


A. The Proposed Scheme 


In what follows, we assume that the terminals share a 
common random i.i.d sequence {Vn}n=i, mutually indepen¬ 
dent of the noise sequences and the messa ge, w here Vn 
Uniform([—|)). As before, we set d = s/ 12P. Recall the 
definition of 0„ and e„ in Subsection IIV-CI as the estimator 
of 0 and the corresponding estimation error at Terminal B at 
time n. 

A block diagram of the scheme is depicted in Fig. |2] Let us 
describe our scheme in detail. The scheme is given in terms 
of the parameters a,/)„, 7 „ which dictate the performance. 
The specific choice of these parameters is given in the next 
subsection. 

(A) Initialization: 

Terminal A: Map the message W to a PAM point 0. 
Terminal A =7 Terminal B: 

. Send Xi = VPQ 

• Receive Yi = Xi + Z\ 

Terminal B: Initialize the 0 estimate to 0i = 

(B) Iteration: 

Terminal B ^ Terminal A: 

• Given the 0 estimate 0„, compute and send 


Xn = Md 


'JnOn + Vn 


• Receive = A„ -f 

Terminal A: Extract a noisy version of estimation error 

^n- 


Pmod = 2Q 

Observe that a larger L implies a smaller variance of the 
modulo argument in (fTSl i. hence a smaller modulo-aliasing 
error probability. On the other hand, a larger L implies a 
smaller 7, and hence a larger estimation error by virtue of ( fOl l. 
In the sequel, it will be convenient to express our results in 
terms of L instead of 7, as the former is a more natural 
parameter of the problem. 

V. Simple Interaction 

In this section we present our simplicity-oriented interactive 
scheme, using the S-K scheme and the scalar modulo JSCC 
scheme with side information as building blocks. We analyze 
the associated capacity gap and discuss implementation issues. 
The scheme is presented in Subsection IV-AI An upper bound 
on the capacity gap attained by the scheme is given in Subsec¬ 
tion [WB] and proved in Subsection I V-CI Numerical results are 
presented in Subsection IV-DI Practical implementation issues 
are addressed in Subsection IV-El and a concluding discussion 
appears in Subsection IV-El 





In 


Yn - 7n0 - K 


7^^ti “f Zji 


(16) 

(17) 


Note that £« = Sn H— —Z^, unless a modulo-aliasing 

In 

error occurs. 

Terminal A ^ Terminal B: 


• Send a scaled version of en'- = a'yn'en, where 

a is set to satisfy input power constraint P (computed 
later). 

. Receive Y^+i = X^+i -f Z^+i 

Terminal B: Update the 0 estimate 0„+i = 0„ — en, 

where 


£n — /^n-t-l Vn-t-1 

The choice of fin is described in the sequel. 

(C) Decoding; 

At time N the receiver decodes the message using a 
minimum distance decoder for 0jv w.r.t. the PAM con¬ 
stellation. 
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Terminal A Terminal B 



Fig. 2. Block diagram of the proposed scheme 


B. Main Result: The Capacity Gap 

Recall the capacity gap function ro(*) of uncoded PAM 
given in ( 01 . Fix a desired number of rounds N, a target error 
probability S, and set the target modulo-error probability to 


2{N-1)' 

Set the looseness parameter to 



and set the scheme parameters a,l3n,jn to 


(18) 


(19) 


a = 



L- 


P 

P’ 


/3n = 


^n—1 


y^'g^SNR- (l - LSNR-1 


a 


1 + 4-3 ^SNR 


Define: 


^-1 

^2 


^3 


= Ip-^sL- ASNR”i 

def 1 


1 - L • SNR-t 



-r def 

W 4 = exp 


SNR-^3 ^(^ 1 ^ 2 ) 


N-i rx" 


( 21 ) 


( 22 ) 


Theorem 1: For the choice of parameters above, the inter¬ 
active communication scheme described in Subsection IV-AI 
achieves in N rounds an error probability Pe < S and a 
capacity gap F^g satisfying: 


nB{S,N) (23) 

< ;^ro,dB (I) + ('I'l.dB + ^2,dB) + 'I's.dB + ^4,dB, 


provided that 'I >4 > 1 (see Remark 0). The penalty term '^3 
is typically negligible (see Remark I?]). 

We prove this theorem is Subsection IV-CI 
Corollary 1 (High SNR behavior): Let ASNR and 5 be 
fixed. The capacity gap attained by our scheme for SNR large 
enough, can be approximated by 


In = 



( 20 ) 


where 


T'3SNR-^ 


^l-bSNR-T'-i 


1 - L • SNR-i 
1 -b T-gL • ASNR-^ 


1 —n 


rdB(<5,iV) 

«^ro.dB (1) + V [1 + ^ASNR-i]^g . (24) 

Note that the first term is roughly the capacity gap of the S- 
K scheme with noiseless feedback, and that the second term 
depends only on ASNR. 

The following remarks are in order. 

Remark 2: The penalty term T'l can be attributed to noise 
insertion from the feedback channel to the feedforward chan¬ 
nel. As shown below in ([30, our feedforward transmission can 











































be interpreted as analog transmission with an additional noise 
term emanating from the feedback channel. The aggregate 
noise variance grows from to causing a decrease 

in the signal-to-noise ratio, and a corresponding increase in 
the capacity gap. 

Remark 3: The penalty term '^2 can be attributed to power 
loss, which is a consequence of the noise insertion discussed 
above. Recall that Terminal A sends ajn^n which is a (scaled) 
noisy version of the true estimation error and where the 
noise stems from the feedback channel. Thus, part of the 
transmission power P is consumed by this noise, leaving only 
a power P/'i >2 < P for the description of £„. This in turn 
reduces the signal-to-noise ratio and correspondingly increases 
the capacity gap. 

Remark 4: The penalty term T'a can be attributed to the 
deviation in the feedforward transmission power caused by 
modulo-aliasing errors. It is important to note that = 

OdB for N < 3. Moreover, 'I'a dB is typically negligible for 
N > 3 and practical settings of S. This can be seen as follows: 
Recalling (fT^ and using the inverse form of the exponential 
bound for the Q function, Q~^{x) < \J ln(2a:), yields 


^*3 < 1 + 
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2{N -1) 


(^N- 1 



< 1 + 

“ 4 



Setting e.g. S = 10“® and = 10 in the above equation 
yields dB < 2 • 10“^dB, which is clearly negligible for all 
practical purposes. 

Remark 5: 4/4 is an additional penalty term, that results from 
the fact that we consider the capacity gap in terms of SNR 
ratios, whereas the explicit term arising from the capacity for¬ 
mula is related to log (1 -f SNR) rather than log (SNR). Note 
that Tfd^dB = O (SNR Due to simplifying assumptions 
taken in the bounding technique it is necessary to assume that 
'kd.dB > 0. The case where 4^4 dB < 0 corresponds to settings 
of very low SNR/ low ASNR/ very low 6 which is a less 
interesting regime. Note that in this case the capacity gap can 
still be calculated, but the bound is more cumbersome and is 
left out. 

Remark 6: Note that there is a “low SNR” regime (related 
also to the target error probability or to ASNR), where the 
loss terms 'ki^dB + 'k 2 ,dB are larger than say To^dB (d). In 
that case, setting N = \, namely using an uncoded system 
with no interaction, is the optimal choice of parameters for 
our scheme. 


C. Proof of Theorem [7] 

In Subsection IIV-CI we analyzed the error probability of the 
S-K scheme with noiseless feedback, relying on the fact that 
all the noises are jointly Gaussian, including the noise en 
experienced by the PAM decoder. To that end, we were able 
to directly use the error probability analysis of simple PAM 
over an AWGN channel discussed in Subsection IIV-AI 

In the noisy feedback case however, the non-linearity of the 
modulo operations at both terminals induces a non-Gaussian 
distribution of em- An analysis of the decoding error based 
on the actual distribution of en seems involved. Yet, an 
upper bound can be derived via a simple coupling argument 
described below. 

Recall that Terminal A computes En, a noisy version of the 
estimation etTor at Terminal B, via a modulo operation (fTbl l. 
For any nS A—1} we define as the event where 

this computation results in a modulo-aliasing error, i.e., 

En = {lneu + Zni[-i,i)}. (25) 

Furthermore, we define Em to be the PAM decoding etTor 
event: 

En = {sn i [-%^,%^)}, 

where dmin is the minimal distance of the PAM. The error 

def 

probability of our scheme is clearly = Pt:{En)- However, 
as mentioned above, the distribution of Em is not Gaussian 
due to the nonlinearity introduced by the modulo operations, 
which in turn renders difficult to compute. To circumvent 
this problem, we instead consider the following trivial upper 
bound 



We intuitively expect this bound to be rather tight, since a 
modulo-aliasing etTor is very likely to imply a PAM decoding 
error. It turns out that upper bounding the right-hand-side 
above is not too difficult, as we now show. 

To proceed, we define the coupled system as a system that is 
fed by the exact same message, and experiences the (sample- 
path) exact same noises. The coupled system differs from 
the original system in two aspects: first, it does not apply 
any modulo operations at either of the terminals; second, its 
feedforward power constraint is set to be P' = P/ 4'3 (where 
we appropriately define SNR' SNR/ 4 ' 3 ). We denote all 
the signals and events in the coupled system using the same 
notation as in the original system, but with an additional prime 
(') symbol, unless the signals are always identical between the 
systems by construction (e.g. Z„). Clearly, the coupled system 
can violate the power constraint at Terminal B. However, given 
the message W, all the random variables in the coupled system 
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are jointly Gaussian, and in particular, the estimation errors 
in that system are Gaussian for n = 1,... ,N. Moreover, 
it is easy to see that given no modulo-aliasing has occurred 
up to time n, the estimation errors are sample-path identical 
between the systems, i.e. = £«■ This leads to the following 
lemma: 

Lemma 1: For any > 1: 

Pr(^UE„j =Pr(^UE;^. 

Proof: Define the event 

n 

Jn = n ^ 

i=l 

Let us show by induction that Jn = J^- For n = 1, we have 

Ji = {71^1 + Zi £ |)} 

= {7ie'i + ^ie (27) 

= J[ 

where dZTl l follows from the sample path identity. Assuming 
Jk-i = J'k-i using the sample path identity again, we 
have 

Jk = {jk^k + Zk £ J f)} G Jk -1 
= {jks'k + Zk £ I))} n Jl_i 
= J'k 

By the exact same argument (replacing £ with 0 we clearly 
have that J„_i (7 En = J'n-i G E'^. Thus we can write 

C N \ N /n-l 

IJ £;„ = Pr(L;i) + ^ Pr fl n 

n—1 / n—2 Vi—1 

N 

= Pr( Ji) + ^ Pr {Jn-i G En) 

n—2 
N 

= pr(j[) + ^pr(j;_in£;;) 

Combining the above with (l26l l and applying the union 
bound in the coupled system, we obtain 

N 

Pe<J2Pr{E'n). (28) 

n—1 

Thus, we can now upper bound the error probability by 
calculating probabilities in the coupled system, which involves 
only scalar Gaussian densities and significantly simplifies the 
analysis. 

Let us begin by calculating the scheme parameters, and 
then use them to calculate the capacity gap. As mentioned 


in Subsection IV-BI the target error probability is set to 6. Let 
us set Pt{E'j,) = |, and Pr(L;() = • • ■ = Pi{E'n_,) = 
where is given in (fTSl ). and calculate the corresponding 
L. Recalling the definition of the event E'n in (l25T l and that 
d = Vl2P, and since 7 „£^ -f Zn is Gaussian, we have that 

p'n,=2Q( J . 

\\Eijne'n + Zn)y 

Using the definition of L in (fTSl) . where here = 

E(£^)^ yields 

+ Zn)^ = 7n'^n ~ "L ' 

First note that solving this equation for 7 „ yields ( l20b . Also 
note that combining the above two equations gives us the 
following setting of L: 



With L in hand, we can easily choose a so that the power 
constraint at Terminal A is satisfied with equality. Namely 

P' = E(X;+i)"=E(a7„£„)2 

= q;^E( 7„£„)^ = a^E(7„£^ + Zn)'^. 

From (|2^ it follows that: 



The parameter /3„ determines the evolution of the estimation 
error. The linear estimate of £^: ^ = jdn+iYn+i, is the 
optimal estimate in the coupled system, in which and 
are jointly Gaussian. We would thus like to minimize 
E (£^ — £^)^. Recalling the input-output relation of the feed¬ 
forward channel and using (l30l l we obtain 

Yn+l = \J{inS'n + Zj^ -£ Zn+l ( 31 ) 

and solving the optimization for /3„ yields: 


(T 1 + SNR' 

Noting that e'n+i = e'n —fn computing the MMSE for the 
optimal choice of /3„+i above, we obtain a recursive formula 
for Gn and SNR„: 


y/sNR' • ( 


1 - LSNR- 


1 -f SNR' 


SNR„ = X 


= SNR' • 1 -f SNR' • 


= SNR' • (1 -f SNR' • T'iT'2) 


1 - L • SNR-i 
1 -f T-aL • ASNR- 

n — 1 


(32) 
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It is possible to give a different and modular interpretation 
for (|3^ . Let us rewrite (ISTl l: 


Y' — 



(33) 


This equation can be regarded as a JSCC problem designated 
for the transmission of over an AWGN channel. The effec¬ 
tive noise of this AWGN channel is + Zn+i- Some 

algebra shows that the variance of this noise is where 'Pi 
is defined in (l 2 Tl l. We call this phenomenon noise insertion, 
as previously mentioned in Remark |2] A consequence of this 
phenomenon is that part of the transmission power is now 
consumed by the noise element related to Subtracting this 
penalty from P' shows that the part of transmission power used 
for the description of is P' We call this phenomena 
power loss as previously mentioned in Remark [3] So, all in 
all, the SNR' of the channel describe by (l33T l is • SNR'. 
Using this fact together with the SNR evolution of the S-K 
scheme (O, and noting that the noise insertion and power 
loss effects only occur after the second round, we obtain (l32]) . 

We are now left with calculating the capacity gap. Let us 
find the condition that guarantees our scheme operates within 
the target error probability requirement. Rewriting (l28l l we get 


N 


Pe 




< 2 (^iv) 




(34) 


hence by setting 


Q 


SSNRat 
- 1 


(35) 


we obtain that Pf, < 6 as desired. We are now in a position 
to derive a lower bound on the capacity gap attained by 
our scheme: we can rearrange (l35] l to obtain a lower bound 
on R, use the expression (l32l i for SNRjv, and plug this 
into the definition of the capacity gap (|2]l. This yields (l23t . 
where 'I' 4 ,dB is a remainder term obtained by pedestrian 
manipulations using the inequality — ln(l — x) < for 
X S (0,1). Note that the result was obtained for the specific 
choice p'^ = 2 (n-i) of modulo-aliasing error. In general, 
reducing p^ increases L which in turn decreases SNRjv, and 
hence increases the second addend on the right-hand-side of 
(l34l i. resulting in a trade-off that could potentially be further 
optimized. 


We are now left with specifying the relation between P' 
and P, captured by 4 / 3 . We start by defining the following 
sequence of events: 

A„ = {X„ = X'J forn=l,...,W 

Using this definition, we can write 

E(X^) = Pr(A„)E {Xl I A„) + Pr(:4;)E {X^ \ A^) 

E«)2 = Pr(A„)E {iX'„f I A„) + Pr(A;)E {{X'^f \ A^) . 


Combining these expressions, while noting that E(X^)^ = P' 
and that E | A„) = E | A„), we obtain: 

E(X2) = P' + Pr(:4;) • E - {X'„f \ ^ 

<P' + Pr(^) ■E{X^\ a;) (36) 

Let us now calculate (l3^ separately for n < 3 and n > 3. For 
n = 1 we have Xi = by construction, hence Pr(Ai) = 0 
and thus E(Xi)2 = P'. For n = 2 we have by (fTTI) : 


Xn = aMrf 


7i£i -I- Zi 


— Ct ^7l£l + ■ 


By definition of the scalar modulo operation it holds that 
[x] I < |a;|, and therefore \Xn\ < |Ar^|. , which implies 
that E(X„)^ < P'. We are now left with the more general 
case of n > 3. We have that 


E 


{^n I 2ln) < E 7ri£n + ^ | 


d 

= I a- 


= i L 


p' \/i^Y 

p 2 j 


= 3LP' 


Plugging the above in (l36b yields 

E{X^f < P' {l + 3Pv(A)L) 

We are left with bounding Pr(A„) for n > 3. To that end, note 
that Xn ^ X'^ implies that at least one modulo error occurred 
up to time instant n — 1, with probability one. Therefore: 

C n-l \ n-1 

U - X! = (n - l)p^ 

2^1 / 
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Fig. 3. The capacity gap as function of the iterations and ASNR for a target Fig. 4. The capacity gap as function of the iterations and ASNR for a target 
rate R = 1 (low SNR), and target error probability pt = 10“® rate R > 4 (high SNR), and target error probability pt = 10“® 


Collecting the upper bounds on E(X^) for all n = 1,iV, 
we obtain the following upper bound on the average power 
consumed by our scheme: 

1 ^ 
n—1 

p' ( ^ 

^^[2 + ^(1 + 3L(n - l)p^) 

\ rt=3 

P' , 3(7V-2)(7V+1)^ , 

= lV r +-2- 

Hence, setting P' = P/'l'a where is given by (l22li obeys 
the feedforward transmission power constraint. This concludes 
the proof. 

D. Numerical Results 

The behavior of the capacity gap for our scheme as a func¬ 
tion of the number of interaction rounds and ASNR is depicted 
in Fig. |3] and Fig. |4] for high SNR and low SNR setups. In 
both figures we plotted the capacity gap, for a target rate R 
and a target error probability pe = 10“®, where the SNR 
is found by numerically solving R = -^ log + ro^,^ 2 ) ) 
and the capacity gap is calculated by definition. It can be 
seen that the higher the ASNR, the smaller the capacity gap, 
where at ASNR = 30dB we virtually obtain the noiseless 
feedback performance. The points marked Uopt are those for 
which the capacity gap is less than 0.2dB above the minimal 
value attained. In Fig.|3]the rate was set to i? = 1, and it can be 
seen that ASNR = lOdB reduces the capacity gap to 4.2dB 
in 12 iterations, and ASNR = 20dB reduces the capacity gap 
to l.ldB in 22 iterations. In Fig. IH the rate was set to i? = 4, 


and it can be seen that ASNR = lOdB reduces the capacity 
gap to 3.5dB in 11 iterations, and ASNR = 20dB reduces the 
capacity gap to O.SdB in 19 iterations. Observing (l24ll we can 
see that for high SNR the result is only a function of ASNR, 
thus does not depend on the target rate or the base SNR. 

E. Notes on Implementation 

The scheme described in this section is both simple and 
practical, as opposed to its noiseless feedback counterparts that 
break down in the presence of feedback noise. This provides 
impetus for further discussing implementation related aspects. 
The following conditions should be met for our results to carry 
merit: 

1) Information asymmetry: Terminal A has substantially more 
information to convey than Terminal B. 

2) SNR asymmetry: The SNR of the feedforward channel is 
lower than the SNR of the feedback channel. This can 
happen due to differences in power constraints (e.g. when 
Terminal A is battery operated and Terminal B is connected 
to the power grid), path losses, or noise/interference asym¬ 
metry. 

3) Complexity/delay constraints: There are severe complexity 
or delay constraint at Terminal A. 

4) Two-way signaling: Our scheme assumes sample-wise 
feedback. The communication system should therefore be 
full duplex where both terminals have virtually the same 
signaling rate; hence, the terminals split the bandwidth 
between them even though only Terminal A is transmitting 
information. This situation can sometimes be inherent to 
the system, but should otherwise be tested against the (non¬ 
interactive) solution where the entire bandwidth is allo¬ 
cated to Terminal A. This choice of forward vs. feedback 
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bandwidth allocation yields a system trade-off that is SNR 
dependent; Terminal A can use our scheme and achieve 
a rate of (^(SNRjiB — (where C'(-) is the Shannon 
capacity function), or alternatively employ non-interactive 
codes over the full forward-feedback bandwidth. The latter 
option doubles the forward signaling rate but also incurs 
a 3dB loss in SNR and a potentially larger capacity gap 
r^g, resulting in a rate of 2C'(SNRdB — 3dB — T^g). It 
can therefore be seen that our solution is generally better 
for low enough SNR. For instance, for 6 = 10“® and 
ASNR > 30dB our scheme outperforms (with comparable 
complexity and delay) full bandwidth uncoded PAM for 
any SNR < 23dB, and outperforms (with significantly 
smaller complexity and delay) full bandwidth non-feedback 
codes with T^g = 3dB for any SNR < 9dB. 

5) Bandwidth splitting: We have tacitly assumed that the 
bandwidth is equally split between the feedforward and 
feedback channels. We note that a non-equal splitting of 
the bandwidth can also be handled. Standard techniques 
for trading bandwidth with SNR can be used in order to 
facilitate the use of our scheme in these scenarios. 

The use of very large PAM constellations, whose size is 
exponential in the product of rate and interaction rounds, 
seemingly requires extremely low noise and distortion at the 
digital and analog circuits in Terminal A. This may appear to 
impose a major implementation obstacle. Fortunately, this is 
not the case. The full resolution implied by the constellation 
size is by construction confined only to the original message 
0 and the final estimate ©^v; the transmitted and received 
signals in the course of interaction can be safely quantized 
at a resolution determined only by the channel noise (and not 
the final estimation noise), as in commonplace communication 
systems. Figuratively speaking, the source bits are gradually 
revealed along the interaction process, where the number of 
bits revealed in every round is determined by the channel SNR. 

Another important implementation issue is sensitivity to 
model assumptions. We have successfully verified in sim¬ 
ulation the robustness of the proposed scheme in several 
reasonable scenarios including correlative noise, excess quanti¬ 
zation noise, and multiplicative channel estimation noise. The 
universality of the scheme and its performance for a wider 
range of models remains to be further investigated. 

F. Discussion 

Note that so far we have limited our discussion to the PAM 
symbol. The bit-error rate is in fact lower, since an error in 
PAM decoding affects only a single bit with high probability 
assuming Gray labeling. However, note that the modulo¬ 
aliasing error will typically result in many erroneous bits, and 
hence optimizing the bit error rate does not yield a major 


improvement over its upper bound. Further fine-tuning of the 
scheme can be obtained by non-uniform power allocation over 
interaction rounds in both Terminal A and B; in particular, we 
note that Terminal B is silent in the last round, which can be 
trivially exploited. 

We note again that for any choice of SNR and ASNR, the 
error probability attained by our scheme cannot be made to 
vanish with the number interaction rounds while maintaining 
a non-zero rate, as in the noiseless feedback S-K scheme case. 
The reason is that dEll implies that L < SNR, which in 
turn by (fT4l) imposes a lower bound on the attainable error 
probability, dictated by the probability of modulo-aliasing of 
the feedback noise. Equivalently, one cannot get arbitrarily 
close to capacity for a given target error probability, since 
increasing the number of iterations improves SNRat and 
reduces the PAM decoding error term, but at the same time 
increases the modulo-aliasing error term in O^ . Hence, our 
scheme is not capacity achieving in the usual sense. However, 
it can get close to capacity in the sense of reducing the capacity 
gap using a very short block length, typically A Ri 20 in the 
examples presented. To the best of our knowledge, state-of- 
the-art (non-interactive) block codes require a block length 
typically larger by two orders of magnitude to reach the same 
gap at the same error probability. Consequently, the encoding 
delay of our scheme is markedly lower than that of these 
competing schemes. Alternatively, compared to a minimal 
delay uncoded system under the same error probability, our 
scheme operates at a much lower capacity gap for a wide 
regime of settings, and hence can be significantly more power 
efficient. 

Another important issue is that of encoding and decoding 
complexity. Our proposed scheme applies only two multi¬ 
plications and one modulo operation at each terminal in 
each interaction round. This is significantly lower than the 
encoding/decoding complexity of good block codes, even if 
non-optimal methods such as iterative decoding are employed. 

VI. Improving Reliability 

In this section we describe two asymptotic versions of 
our simple scheme that are aimed at improving reliability at 
the cost of increased complexity and delay. These scheme 
are shown to outperform their non-feedback counterparts in 
the error exponent sense, when ASNR is large enough. The 
first scheme is based on replacing the scalar PAM signaling 
with a random block code, and replacing the scalar modulo 
operation with a modulo-lattice operation. The error analysis 
of both the modulo error and the decoder decision error is 
performed in the coupled system as before, but are now 
concerned with error exponents instead of the scalar Q- 
function. The second scheme is based on concatenated coding 
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with the simple interaction scheme as an inner code and a 
random block outer code. Classical error exponent results are 
given in Subsection IVI-AI JSCC with lattices is discussed in 
Subsection IVI-BI The first scheme is introduced in Subsec¬ 
tion IVl-CI A lower bound on its error exponent is given in 
Subsection IVI-Dl and proved in Subsection IVl-EI The second 
scheme is introduce Subsection IVI-FI A concluding discussion 
appears in Subsection IVI-GI 


A. Block Codes and Error Exponents 

In what follows, we replace the scalar PAM mapping of 
the message point W ^ Q with an AWGN channel block 
code mapping kP —0 of length N. In this subsection we 
cite the classic results on the performance of block-codes 
for the AWGN channel. For channel coding over the AWGN 
channel with a signal-to-noise ratio SNR and rate R, there 
exist block codes of length N whose average error probability 
(averaged over the messages) under maximum likelihood de¬ 
coding is exponentially upper bounded by Pt{W(Y) ^ W) < 
g-AfBrlSNR.tt) Yvjjere Er{SNK, R) is given by jh): 


{ EspiSNR, R) if Rrc<R<C 
Erc{Sl<lR, R) if Rex <R< Rrc 
F;ex(SNR,i?) iff)<R<Rex 


The boundaries between the regions are as follows. The 
Shannon capacity is (7 = ^ log(l -l- SNR). The critical 
rate is Rcr ^/2 log ^ 1/2 -|- Snr ^4 _|_ 1/2 y'1 -|- SNR^/ 4 ^. The 

expurgation rate is Rex 1/2 log {^/2 + ^/ 2 yJ\ + SNR^/ 4 ^. 
The exponents in the above three regions are given by; 

E,p{S^R, R) = ^-^(^P + l-[P- l)^l+sj4|3Ty) 

+ 1 In - SNR03 1) ^ SNR^(^-1)') (37) 

where /3 = 2 ^^, 


Erc{SNR,R) = 1-/3 + 


1 


SNR 1 


+-log [/3- 


SNR\ 

^ J 


- - log(/3) - log(2)R 
where here /3 = and 

Eex{S^R,R) = ^ fl - \/l - 2-2« 


B. JSCC with Side Information Using High Dimensional 
Lattices 

From this point on, boldface letters such as X, 0 de¬ 
note vectors of size N\. As shown in Subsection IIV-DI the 
probability of modulo-aliasing error in the JSCC problem 
with side information in its scalar version, while sometimes 
small, is bounded away from zero. In order to make this 
probability arbitrarily small, the dimension of the scheme 
should be increased. This can be achieved by introducing 
large dimensional lattices and replacing the scalar modulo with 
the corresponding modulo-lattice operations. Let us start by 
quickly surveying a few basic lattice notation and properties 

ED: 

(i) A lattice of dimension Xa is denoted A = G ■ 
where G is the generating matrix. 

(ii) Vol(A) = I det(G)| is the lattice cell volume. 

(iii) The nearest neighbor quantization of x w.r.t. A is denoted 
Qa N- 

(iv) Vq = {x : Qa [a:] = 0} is the fundamental Voronoi cell 
pertaining to A. 

def 

(v) The modulo A operation is Ma [a;] = x — Qa [a:]. 

(vi) Ma [•] satisfies the distributive law 

Ma [Ma [a;] -p y] = Ma [a; -I- y] 

(vii) The volume to noise ratio (VNR) of a lattice in the 
presence of AWGN with variance is p(A) 
[Vol(A)]"/^^ /(t2. 

def 

(viii) The normalized second moment of a lattice A is G(A) = 
(t2(A)/[Vo 1(A)]^/^^, where a^{A) = ^E(||y||2) and 
V is uniformly distributed on Vq. 

Consider again the JSCC problem introduced in Subsec¬ 
tion |IWDl where now Terminal B is in possession of a vector 
0 = 0 -f £, and wants to convey the i.i.d. ^ A/"(0, cr^) 
etTor vector e to Terminal A which is in possession of 0, 
over an AWGN channel Y = X + Z. The channel is again 
characterized by Z ~ A/’(0 ,ct^) and < P. We assume 
that a dither signal V ^ Uniform (Vo), mutually independent 
of the message and the channel noises, is known at both 
terminals. 

Let us revise the JSCC with side information scheme 
presented in Subsection IIV-DI replacing the scalar modulo 
operation M^ [•] with the lattice modulo operation Ma [•]. 
Hence, Terminal B transmits 


It is also well known and readily verified that for 0 < i? < 
G, the exponent Esp{SNR, R) coincides with the asymptotic 
expression of Shannon’s sphere packing bound for the AWGN 
channel ll20l . Hence, i5sp(SNR, R) is also an upper bound for 
the reliability function, and is tight above the critical rate. 


X = 


y@ + V 


Terminal A estimates: 


e = —Ma 
7 


X - 70 - y 


ye + Z 
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And if 7£ + G Vo then 

e = £ H —Z 

7 

We are now ready to set the parameters of the modulo- 
lattice scheme. Let us set the lattice’s second moment to 
equal the feedback power constraint cr^ (A) = P. This would 
guarantee (due to dithering) that the feedback transmission 
power constraint is satisfied. The modulo-aliasing error event 
is the event where 

7£ -f .Z ^ Vo 


Recall the definition of the looseness parameter L of the lattice 
in (O, and note that L = /r(A) • G'(A). It was shown in ll^ 
Theorem 5] that there exist lattices that asymptotically attain 
G(A) = 2 ^ + o(l), and a modulo-error that is exponentially 
bounded by pmod < g-AfABp( ali), where Ep{x) is the 
Poltyrev exponent, given by ED 


i (a: — 1 — ln(a;)) if 1 < a: < 2 

Ep{x) = ^ ^ (ln(a:) -f ln(|)) if 2 < a; < 4 

X if a; > 4 


Hence in our notation, such lattices satisfy 


Pmod ^ 


-Nj^EpiL) 


(38) 


Clearly, in a similar fashion to the scalar scheme, the 
setting of L determines the trade-off between modulo-error 
probability and decision error probability. Setting L close 
to 1 will maximize the effective signal-to-noise ratio (and 
maximize the block code error exponent), but at the same 
time minimize the modulo-error exponent. Setting L to be 
large will do the opposite, and will also reduce the maximal 
achievable rate. We note that the corresponding lattice-based 
JSCC scheme in ||9l is better at low SNR, due to the addition 
of another Wiener coefficient multiplier at the receiver before 
the modulo operation, which results in non-Gaussian statistics 
of the error. For simplicity of analysis, this technique is not 
used here. 


C. Description of the Scheme 

In this subsection, we show how to combine the blockwise 
coding and blockwise modulo operations into one scheme. In 
a nutshell, the message W is mapped into a codeword © of 
length Na, and sent in the first block (Na channel uses). 
This replaces the PAM transmission in the scalar scheme. 
In the sequel, vector analog transmission is used over the 
feedforward, and vector modulo-lattice transmission is used 
over the feedback. Ultimately, W is decoded using a maxi¬ 
mum likelihood decoding rule. Since under this protocol both 
terminals are idle half the time, we interlace two identical 
schemes, encoding and decoding two independent messages. 


S-K Rounds : K x 2 



Fig. 5. Blockwise transmission. The time instants are divided into blocks 
of size N\. Single headed arrows and denote transmission from 
Terminal A to Terminal B respectfully. Double headed errors , and 
bear the same meaning but for the second scheme. 


as illustrated in Fig. |5] We denote the block index (or round 
index) by k G AT}. For brevity, and with a mild abuse 

of notation, we only describe the evolution of one of the 
interlaced schemes. 

The setting of the parameters a,fik,lk will be discussed 
in the sequel. The dither variables are i.i.d., uniformly 
distributed on Vo, and mutually independent of the message 
and the noise processes. 

(A) Initialization: 

Terminal A: Map the message W to codeword 0 using 
a codebook for the AWGN channel with average power 
P. 

Terminal A => Terminal B: 


• Send Xi = 0 

• Receive Yi = JCi + Zi 

Terminal B: Initialize the 0 estimate to ©i =Yi. 


(B) Iteration: 

Terminal B =7 Terminal A: 

• Given the 0 estimate 0^, compute and send in the 
following block 




7fe®fc + y k 


• Receive Yk = Xk + Zk 

Terminal A: Extract a noisy scaled version of the 
estimation error vector ek- 



Yk — 'Ik® — Vk 


Note that ek = £k + unless a modulo-aliasing 

error occurs. 

Terminal A ^ Terminal B: 

• Send a scaled version of £fe: Xk+i = a-jk^k, where 
a is set so that the input power constraint P is met. 
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. Receive Yk+i = Xk+i + Zu+i 

Terminal B: Update the © estimate 0fe+i = 0^ — Sk, 

where 

Sk = Pk+lY k+l 

(C) Decoding: After the reception of block K the receiver 
decodes the message W{@k) using an ML decision rule 
w.r.t. the codebook. 


D. Main Result: The Error Exponent 
Set the scheme parameters a^Pk^lk to 



Pk 


CT/c-l 

a 



LSNR-i) 


1 + SNR 



where cr^ = ak{L) = and 

SNRk{L) = SNR (l + SNR - 4~AfNR-i > (39) 


The following theorem provides a lower bound on the error 
exponent obtained by our scheme. 

Theorem 2: For the choice of parameters above, the interac¬ 
tive communication scheme described in Subsection IVI-CI with 
a total delay of N time instants, attains an error probability 
Pe < where 


EpB (R) 


def 

= max 
KgN,L>1 


mm{Er{SNRK{L),KR),Ep{L)}\ 

/ ■ 
(40) 


E. Proof of Theorem |2] 

Dehne the error event 

Ek = {i?(©k) ^ lu} 

The error probability of each of the interlaced schemes is 
Pe = Pr{EK), and hence the total error probability is upper 
bounded by 2pe- Therefore, below we analyze only a single 
scheme, since the 2 factor does not change the exponential 
behavior. As in the analysis of the scalar scheme, the channel 
© —YK not Gaussian due to the non-linear modulo 
operations, which complicates a direct analysis. In order to 
circumvent this, we will upper bound the error probability by 
further taking modulo-error events into account, and working 
in the couples system as before. These errors events are defined 
by 

Ek = |7fc£fe E Zk ^ Vo I 


and we have that 

Pe ([J Ek 

\k=l 

Applying the coupling argument of Lemma [T] we can obtain 

Using the union bound we obtain 

K 

Pe < ^Pr(L;0- 
fc=i 

Calculating the above probabilities now involves only Gaus¬ 
sian random vectors, which significantly simplifies the analy¬ 
sis. 

From this point on, we perform an asymptotic exponential 
analysis. We set the parameters such that all modulo-aliasing 
error probabilities are equal. Hence 

K 

Pe < ^Pr(£;^) < max(^jnod,Pdec) 
k^l 

where without loss of asymptotic optimality we have set all 

def 

the modulo-error probabilities to be equal pmod = Pr(ii^^), 

def 

and also defined pdec = Pi'(^'r-)- The modulo-aliasing eiTor 
can be exponentially upper bounded by the Poltyrev exponent 
( 1^ . i.e. ptnod < 

We observe that the channel 0 —&k is in fact equivalent 
(in the coupled system) to Na parallel independent AWGN 
channels each with the same noise variance cr^, and with a 
signal-to-noise ratio SNRif(L) given in (l39l) . We can now 
encode the message W into 0 using a Gaussian codebook of 
block length Na and rate KR to obtain 

Fdec < 

Note that the rate KR is chosen such that the overall rate 
over K rounds is R. Balancing the exponents for pdec and 
Pmod yields the result. The division by 2K is due to the use 
of two interlaced schemes, that doubles the overall delay. 

The trade-off is now clear: setting the lattice looseness L 
to be large reduces pmod but also reduces SNRr'(L) hence 
increasing pdec^ und vice versa. Due to the monotonicity of 
EriSiSSRx(L), KR), Ep{L) in L, a numerical solution to (l40l i 
can be easily found. 

Remark 7: It should be noted that as in the simple interaction 
case, the feedforward transmission power should be reduced 
by a suitable factor vka in order to avoid excess power 
due to modulo-aliasing errors. However, for high-dimensional 
lattices, the probability of a modulo-aliasing error is exponen¬ 
tially small in the dimension of the lattice, and the magnitude 
of the vectors in the fundamental Voronoi cell Vo is upper 
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bounded by P ll22l Section VIII]. Therefore, the power back¬ 
off term Tia is asymptotically negligible and does not affect 
the statement of the theorem. 


F. Concatenated Coding with Simple Interaction 


In this subsection we introduce a concatenated coding 
scheme based on the simple interaction scheme of Section m 
used in conjunction with a (non-interactive) block code. The 
simple interaction scheme is used as an in inner code, induc¬ 
ing a DMC whose input and output correspond to a PAM 
constellation, and a suitable block code is employed as an 
outer code over that induced DMC. If the transition matrix 
pertaining to the induced DMC had been known, the calcu¬ 
lation of the error exponent would have been straightforward. 
Unfortunately, computing this transition matrix appears to be 
extremely involved. Nevertheless, there is one property of 
the transition matrix that is already available to us from the 
preceding discussion - an upper bound on the error probability, 
namely the probability that the output differs from the input. 
Therefore, to circumvent the difficulty in computing the error 
exponent of the induced DMC, we bound it from below by 
computing the error exponent corresponding to a “worst case” 
symmetric DMC, that has an error probability equal to (the 
upper bound on) the error probability induced by our inner 
coding scheme. This leads to the following result: 

Theorem 3: The concatenated coding scheme described 
above, with total delay of N time instants, attains an error 
probability pe < where 


inax 

5e(o,i),i<'eN 


f M, KR) j 


(41) 


M = 
SNRk 


A 


IS given in (1^21 ) and 


is the size of the PAM constellation. 


M, R) = max M, R), M,R)} . 


The random coding error exponent is given by 

E^^‘^{6,M,R) = 


max ■|^—pln(2)i?-|-pln(M) — 

(1 -I- p) In ^(1 — (5)T+p + {M — l)TT?(5T+p^ j- 
and the expurgated error exponent is given by 
E^^^iS,M,R) = 

sup < — pln(2)i?-b pln(M) 
p>i 


- pin ( 1 + (M - 1) (2yiiir + ^«''' 


(42) 


(43) 


Let P denote an M x M transition matrix of a DMC, where 
P {i I j) is the probability that the output is j given that the 
input is i. The maximal error probability of P is defined as 

maxerr(P) 1 — minP(i | i) 

i 

We say that P is totally symmetric, if 


P (* I j) 


1 — (5 if i = j 

if * 7^ j 


For some <5. Clearly, for such P it holds that maxerr(P) = 6. 

The proof of the theorem is based on the observation that a 
totally symmetric DMC has the smallest uniform input random 
coding / expurgated error exponent for a given maximal error 
probability, together with straightforward calculations of the 
classic error exponents for this channel. Specifically, denote 
the random coding error exponent at rate R for a DMC 
with transition matrix P and uniform input distribution by 
Er{R,P). The corresponding expurgated error exponent is 
denoted by Eex{R, P)- The worst case property of the totally 
symmetric DMC is stated in the following lemma, which is 
proved in Appendix lAl 

Lemma 2: Let P he. a M x M DMC with maxerr(P) < 5, 
and let Psym be a totally symmetric M x M DMC with 
maxerr(Psym) = Then Er{R,Psym) < Er{R,P) and 

Eex{R, Psym) < Eex{R,P)- 

Proof of Theorem |5} Let P be the transition matrix per¬ 
taining to the M X M DMC induced by our simple interaction 
scheme, with some target error probability 5 and K interaction 
rounds (the explicit derivation of the scheme parameters and 
the error analysis are given in Section IVTi. This DMC satisfies 
maxerr(P) < <5. Let Pgym denote the transition matrix of a 
totally symmetric AT x M DMC with maxerr(Psym) = 
Lemma |2] implies that the error exponent attained by an 
optimal outer block code is lower bounded by the random 
coding / expurgated exponents of Pgym, where the latter follow 
from a straightforward calculation and are given by (l42l i and 
(l43l l. Finally, dril l follows from the appropriate normalization 
in the inner code delay K, and by an optimization over the 
target error probability and the number of interaction rounds. 


G. Discussion 

Numerical evaluations of Egp, Er, Pfb and P™*' for 
SNR = 20dB are depicted in Fig.|6] It can be clearly seen that 
for low rates is larger than Pfb for both value of ASNR. 
What is less visible in Fig. |6] but is clearer in Fig. |7] is that 
at high rates Pfb exceeds Pp^*“. Comparing these achievable 
exponent to the sphere packing bound, which upper bounds the 
best achievable error exponent without feedback, it is evident 
that both error exponents are smaller than the sphere packing 
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23dB. We can see that dramatically improves over the 
non-feedback achievable error exponent at rates close to zero, 
but then falls below it at rates above 0.46(7 (where C is the 
channel capacity). Comparing to i?™*" we can see that 
the latter is better at rates higher than 0.18(7. Finally, Eps 
is the best among the three feedback error exponent at rates 
above 0.53(7, and also beats the sphere packing bound at rates 
up to 0.9(7. 

It is interesting to note that in ifT^ . the authors characterize 
the “shut-off rate” of their scheme, namely as the rate 
above which their exponent does not improve the no-feedback 
one. This rate is given by Lemma 8 ]: 

\ log (^1 + 2SNR. ASNR • - 7 o)) 


Fig. 6. Error exponents with and without feedback for SNR = 20dB with 
ASNR = lOdB and ASNR = 20dB 


6- 

cd 

2 ; 





C 

O 

Gh 

X 

W 


W 



R/C 


where 70 G [ 0 , 1 ] is a root of a quadratic equation given in 
their Lemma 6 (note that a factor ^ is missing in the original 
expression for Rfg)- Dividing the above by the capacity, it is 
easy to show that 

(7 - 

1 / 1 -f log(l -f ASNR) (1 -f SNR) log e \ 

2 V log SNR 2ASNR-SNR2 log SNR ) 

For SNR ^ 1 and a fixed ASNR, the above upper bound 
clearly converges to i, hence in this high-SNR regime the 
error exponent of does not yield any improvement over 
the non-feedback exponent for rates above half the capacity. 

It is also instructive to compare the error exponents dis¬ 
cussed above at zero rate. The zero-rate exponent attained by 
the scheme in ifT^ is given in Lemma 8 therein; 


Fig. 7. Fig.ia at high rates 


Eg^{R = 0) 


SNR 

4 


(^1 -t- ASNR • 


SNR \ 
1 -f SNR ) 


(44) 


bound for ASNR = lOdB, and the maximal between them is 
greater than the sphere packing bound for ASNR = 20dB. 

It is of interest to compare the error exponents of Theorems 
in and [3 to the noisy feedback error exponent of lfT2l . which 
employs a concatenated linear coding scheme. In this scheme 
the output of the inner code is a linear combination of the 
message variable and noisy versions of the previous channel 
outputs (i.e. passive feedback), and the outer code is a random 
block code. Due to the linear nature of the inner code, its 
output is regarded as a single use of an AWGN channel, 
and the scheme parameters are judiciously chosen to try 
and maximize its SNR. Then, the error exponent of the 
concatenated scheme is readily calculated using the standard 
AWGN error exponents. 

Fig. HJcompares our error exponents to that of Ha, which is 
denoted by Egg". In this setting SNR = lOdB and ASNR = 


It can be verified by direct calculation that at zero rate, the 
error exponent of our concatenated scheme outperforms that of 
the lattice-based scheme. The former is given in the following 
formula at high-SNR, derived in Appendix iBl 

£;™^(i? = 0)> (45) 

I ASNR (e-^SNR - l) - ln(2 [ln(3ASNR)J - 1) 
4[ln(3ASNR)J 

It is easy to see that in the limit of SNR —00 (l44l i is largers 
than (l45l l. 
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Fig. 8. A comparison of non-feedback en'or exponents, our en'or exponents 
{Efb-, and Chance and Love’s error exponent (Epg). In this setting 

SNR = lOdB and ASNR = 23dB 


and 


EexiR, P) = 

max{-pln(2)i? + 2pln(M) - p\nFex{p, P)} , (47) 
p>i 


where 


and 


M-l 


Frip,P)= 

i-0 


'M-l 


E I 




i+p 


F,,ip,P)^^Yl 


k.i 


E/p(7Ffc)PcrR) 


Now, let n denote the set of all possible permutations over 
{1,..., M}. For a given channel transition matrix P, and a 
given permutation tt S Ft, we define the channel transition 
matrix P^r to be 


P.(j|*)=P(^(j)k(*))- 


Appendix A 
Proof Lemma[2] 

The proof is based on a symmetrization argument in con¬ 
junction with a convexity argument. We begin by recalling the 
error exponents for a DMC with transition matrix P, input 
distribution Q and rate R El. The random coding exponent is 
given by 


E,{R,Q,P) = 


max 

0<P<1 


M-l 


—pln(2)i? — In E] 


j—0 L k—O 


M-l 


n i+p^ 


E Qik)PU I kp 


and the expurgated error exponent is given by 


Eex{R,Q,P) 


= max 
p>i 


pln(2)i? 


- p\n'^Q{k)Q(i) 

k,i 


E vpu I k)Pij I *) 



Below we restrict our attention to a uniform input distribu¬ 
tion Q{i) = -E for 0 < i < M — 1, which can be suboptimal 
but facilitates a much simpler analysis. We denote the random 
coding and expurgated exponents corresponding to a uniform 
input by Er{R,P) and Eex{R,P) respectively. These are 
explicitly given by 


Er{R, P) = 

max {—pln(2)i? -f (1 -f p) ln(M) — lnFj.(p, P)} , (46) 


The following lemma easily follows from (l46l l and WT\ by 
incorporating the permutation into the order of summations. 
Lemma 3: Er{R,PTr) = Er{R,P) and EgxiRjPir) = 

EexiR, P) for 71 G n. 

We further define the symmetrized channel matrix Psym 
associated with P, to be the one obtained by uniformly 
averaging over all possible permutations, i.e., 

Psymij I *) = E I *)■ 

I I Tren 

We now have the following lemma. 

Lemma 4: Psym is a totally symmetric DMC with 
maxerr(Psym) < maxerr(P). 

Proof: Choose some indices i,j,k,£, such that if i = j 
then also k = £. Let cr G If be the permutation where (T(i) = k, 
aij) = £, and ct(x) = x otherwise. 

Psymij I *) = E I ^(*)) 

I I Tren 

= E ° I ° 

' ' Tren 

= M ^ P(7r(fc) I 7r(f)) 

' ' Tren 

= Psymik I £)■ 

Hence Psym is a totally symmetric DMC. Specifically, for any 

i 

Psymii I *) = E I ^(*)) 

I I Tren 

= mE E At It) 

k 7T£'n.,7z{i)—k 

= 4^pwt). 

k 
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Thus, 

maxerriPsym) = 1 “ X! 

k 

< 1 — ininP(fc|fc) 

k 

= maxerr(P). 


We now proceed to show that the (uniform input) error 
exponents associated with Pgym can be used as lower bounds 
for those of P. This follows from a concavity argument 
combined with Lemma [3] 

Lcmmd 5. Pgy^'j ^ Ej-i^R^P^ and E^xi^R^Psym) ’E 

Eex{R, P)- 

Proof: We begin by showing that both Pr(p, P) (in (l46l l) 
and Fex (p, P) (in (|47]i) are concave in P for all valid values 
of p. We start with Pr(p, P) and recall its definition; 


Fr{p,P) 


M-1 

E 


'M-l 


E I 


fc=0 


1+p 


We note that I k)~ is a concave function in the 

vector P{j I •); this follows by observing that this function is 
a norm of order € [ 5 ,1] over K.:^^ (and the values of P 
are strictly positive by design). Er(p, P) is thus concave in P 
as well, being a sum of concave functions. 

Similarly, we recall the definition of Fex {p, P) 


Fex{p,P)=Y. 

k,i 




_i 

p 


We note that \/P{j \ k)P{j \ i) is a geometric mean, which is 
concave in the vector (P(j | k), P{j \ i)) £ for any fixed 
k. Thus, E, ^/P{J I k)P {,3 I z) is concave as well, being a 
sum of concave functions. The - power of this sum is concave 

since it is a scalar composition of a concave function with 

1 

a concave nondecreasing function xp (for p > 1). Finally, 
Fex (p, P) is a sum of concave functions hence is also concave. 

With concavity in hand, we can readily apply Jensen’s 
inequality 


Pr{p, Psym) —Fr P-Trj 

>T^\T.^r{p,P.) (48) 

I I Ti-en 

=Fr{p,P) (49) 


where (l48T l follows from concavity via Jensen’s inequality, 
and (l49l l is by virtue of Lemma. 0 Plugging the resulting 
inequality in (l46l l yields Er{R,Psym) < Fr{R,P). Fol¬ 
lowing the exact same steps for Fex we can prove that 

Fex{R,Psynv)<Fex{R,P). ■ 


Let us now calculate the error exponents for Psym- Since 
by Lemma [ 3 ] we have that Psym is a totally symmetric DMC, 
and denoting S maxen:{Psym), we can use (l46l l and (l47l) to 
obtain 


Fr{R: Psym') — mUX s 


0<p<l 


(1-f p) ln(^(l 


pln(2)P-f pln(M)- (50) 
- + {M - l)TT?i5TT^'j I 


and 

Eex{R, Psym) = sup \ - p\u{2) R + p\u{M)- (51) 

P>1 [ 

Lemma [3 also tells us that S < maxerrP. It can be 
verified (by direct differentiation) that both (fSOl l and (ISTI) 
are monotonically decreasing in S for S £ (0, ^). Therefore, 
replacing 6 with any upper bound on maxerr(P) still results 
in a lower bound for both Er{R, Psym) and Eex{R, Psym), 
concluding the proof. 


Appendix B 

Zero Rate Analysis for Theorem[3] 

Below we provide a lower bound for the error exponent of 
Theorem |3] at R = 0. We use the expurgated error exponent 
which is known to be larger at P = 0, and use M = 2 in (|4^ 
which yields 

P™C(^,M = 2,P = 0) = 

sup < pln(2) — pin 

p>i L 

It can be shown (see ||6] Probelm 5.24]) that for every input 
distribution, the zero rate error exponent is optimized by taking 
the limit p —oo, which in the case of uniform input results 
in: 

M = 2, P = 0) = In (2 

= -l\n{2S{l-6)) 

>-iln(2,5) (52) 

Let us now find a simple setting for the scheme’s param¬ 
eters. Applying the union bound it is clear that the error 
probability of the inner code is upper bounded by S, where 

d = Q (x/SNR^f) + 2{K - 1)Q (ySLj , (53) 

where the first addend is due to the probability of error at the 
PAM (here, BPSK) decoder, and the second addend is due 
to the modulo-aliasing error probabilities. In order to simplify 
the solution, we impose equality between the two Q-functions, 


(^1+(2^5(1^)'^) I. 
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which results in SNR/^ = 3L. Using (l32l i and some algebra, 
we obtain 


SNR 


/ASNR(1 + SNR) 
ASNR + L 


K-l 

= 3L. 


(54) 


This equation is difficult to solve. To simplify, we note that 
solution of the following equation for L: 

gives a lower bound on the corresponding solution of (l54l i. 
yielding 


L = 

- ASNR + 3"^ ASNRi-^ (SNR(1 + SNR)^-^) ^(55) 

It is instructive to use a high-SNR approximation 1 + SNR ra 
SNR and compute an approximate (yet attainable) version of 
(|55]l: 

L = ASNR ((SASNR)--^ SNR - l) 

Plugging this into (l5^ and using the exponential bound for 
the Q-function: Q{x) < iexp(—we get 

5<{2K- l)Q(^) < exp 

Now, plugging the above into the error exponent bound (l52T i 
and recalling the concatenated coding exponent (ItIT i. results 
in 


= 0 ) 


> max • 
K&i 


= max ■ 
Km 


|L - ln(2R: - 1) 
4R 


fASNR ((3ASNR)“^ SNR - l) - ln(2R: - 1) 


AK 


Let us now approximately optimize for K, by taking the 
derivative to zero in a simplified expression 

(3ASNR)-y 

K 

yielding K* = [ln(3ASNR)J and (3ASNR)-i^ > 
(3ASNR)~ in{3ASNR) = g-1 finally obtaining the following 
achievable error exponent; 


(56) 

fASNR (e-^SNR - l) - ln(2 [ln(3ASNR)J - 1) 
4[ln(3ASNR)J 

Let us now evaluate = 0) for SNR >> 1 when 

ASNR is held fixed. In this case (l56l l can be approximately 
expressed as 


l;™^(r = o) 


> 


3ASNR 


8e[ln(3ASNR)J 


SNR. 


(57) 


Note that by dJTl) the sphere packing bound at i? = 0 is 
fSNR rendering (|57] | better for ASNR^b ^ 10.4dB. It is 
interesting to note that in this setting (l44l) still yields a larger 
value. 
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